Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 1108 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 190.6 KiB |
| Average record size in memory | 176.1 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 12 |
Dt_Customer has a high cardinality: 536 distinct values | High cardinality |
Income is highly correlated with Kidhome and 5 other fields | High correlation |
Kidhome is highly correlated with Income and 3 other fields | High correlation |
NumWebPurchases is highly correlated with Income and 3 other fields | High correlation |
NumCatalogPurchases is highly correlated with Income and 5 other fields | High correlation |
NumStorePurchases is highly correlated with Income and 4 other fields | High correlation |
NumWebVisitsMonth is highly correlated with Income and 1 other fields | High correlation |
target is highly correlated with Income and 4 other fields | High correlation |
Income is highly correlated with NumCatalogPurchases and 3 other fields | High correlation |
Kidhome is highly correlated with NumCatalogPurchases and 1 other fields | High correlation |
NumWebPurchases is highly correlated with NumStorePurchases and 1 other fields | High correlation |
NumCatalogPurchases is highly correlated with Income and 4 other fields | High correlation |
NumStorePurchases is highly correlated with Income and 3 other fields | High correlation |
NumWebVisitsMonth is highly correlated with Income and 1 other fields | High correlation |
target is highly correlated with Income and 4 other fields | High correlation |
Income is highly correlated with NumCatalogPurchases and 2 other fields | High correlation |
Kidhome is highly correlated with NumCatalogPurchases | High correlation |
NumWebPurchases is highly correlated with NumStorePurchases and 1 other fields | High correlation |
NumCatalogPurchases is highly correlated with Income and 3 other fields | High correlation |
NumStorePurchases is highly correlated with Income and 3 other fields | High correlation |
target is highly correlated with Income and 3 other fields | High correlation |
Income is highly correlated with Kidhome and 7 other fields | High correlation |
Kidhome is highly correlated with Income and 4 other fields | High correlation |
Teenhome is highly correlated with NumDealsPurchases | High correlation |
NumDealsPurchases is highly correlated with Teenhome and 1 other fields | High correlation |
NumWebPurchases is highly correlated with Income and 2 other fields | High correlation |
NumCatalogPurchases is highly correlated with Income and 3 other fields | High correlation |
NumStorePurchases is highly correlated with Income and 5 other fields | High correlation |
NumWebVisitsMonth is highly correlated with Income and 3 other fields | High correlation |
AcceptedCmp5 is highly correlated with Income and 2 other fields | High correlation |
AcceptedCmp1 is highly correlated with Income and 1 other fields | High correlation |
target is highly correlated with Income and 5 other fields | High correlation |
id is uniformly distributed | Uniform |
Dt_Customer is uniformly distributed | Uniform |
id has unique values | Unique |
Recency has 15 (1.4%) zeros | Zeros |
NumDealsPurchases has 27 (2.4%) zeros | Zeros |
NumWebPurchases has 26 (2.3%) zeros | Zeros |
NumCatalogPurchases has 279 (25.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-02 13:53:35.724126 |
|---|---|
| Analysis finished | 2022-05-02 13:53:42.321675 |
| Duration | 6.6 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 1108 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 553.5 |
| Minimum | 0 |
|---|---|
| Maximum | 1107 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 55.35 |
| Q1 | 276.75 |
| median | 553.5 |
| Q3 | 830.25 |
| 95-th percentile | 1051.65 |
| Maximum | 1107 |
| Range | 1107 |
| Interquartile range (IQR) | 553.5 |
Descriptive statistics
| Standard deviation | 319.9963541 |
|---|---|
| Coefficient of variation (CV) | 0.5781325278 |
| Kurtosis | -1.2 |
| Mean | 553.5 |
| Median Absolute Deviation (MAD) | 277 |
| Skewness | 0 |
| Sum | 613278 |
| Variance | 102397.6667 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 737 | 1 | 0.1% |
| 743 | 1 | 0.1% |
| 742 | 1 | 0.1% |
| 741 | 1 | 0.1% |
| 740 | 1 | 0.1% |
| 739 | 1 | 0.1% |
| 738 | 1 | 0.1% |
| 736 | 1 | 0.1% |
| 728 | 1 | 0.1% |
| Other values (1098) | 1098 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 1107 | 1 | |
| 1106 | 1 | |
| 1105 | 1 | |
| 1104 | 1 | |
| 1103 | 1 | |
| 1102 | 1 | |
| 1101 | 1 | |
| 1100 | 1 | |
| 1099 | 1 | |
| 1098 | 1 |
Year_Birth
Real number (ℝ≥0)
| Distinct | 57 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1968.701264 |
| Minimum | 1893 |
|---|---|
| Maximum | 1996 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 1893 |
|---|---|
| 5-th percentile | 1949 |
| Q1 | 1959 |
| median | 1970 |
| Q3 | 1977 |
| 95-th percentile | 1988 |
| Maximum | 1996 |
| Range | 103 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 12.22537967 |
|---|---|
| Coefficient of variation (CV) | 0.006209870383 |
| Kurtosis | 1.18745119 |
| Mean | 1968.701264 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.439100387 |
| Sum | 2181321 |
| Variance | 149.4599081 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1971 | 46 | 4.2% |
| 1976 | 42 | 3.8% |
| 1970 | 42 | 3.8% |
| 1973 | 41 | 3.7% |
| 1975 | 39 | 3.5% |
| 1978 | 38 | 3.4% |
| 1969 | 36 | 3.2% |
| 1965 | 35 | 3.2% |
| 1972 | 34 | 3.1% |
| 1958 | 31 | 2.8% |
| Other values (47) | 724 |
| Value | Count | Frequency (%) |
| 1893 | 1 | 0.1% |
| 1900 | 1 | 0.1% |
| 1940 | 1 | 0.1% |
| 1941 | 1 | 0.1% |
| 1943 | 5 | |
| 1944 | 4 | 0.4% |
| 1945 | 5 | |
| 1946 | 10 | |
| 1947 | 8 | |
| 1948 | 12 |
| Value | Count | Frequency (%) |
| 1996 | 1 | 0.1% |
| 1995 | 3 | 0.3% |
| 1993 | 3 | 0.3% |
| 1992 | 6 | 0.5% |
| 1991 | 4 | 0.4% |
| 1990 | 9 | 0.8% |
| 1989 | 21 | |
| 1988 | 15 | |
| 1987 | 18 | |
| 1986 | 26 |
Education
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| Graduation | |
|---|---|
| PhD | |
| Master | |
| 2n Cycle | |
| Basic | 22 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 7.510830325 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8322 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Master |
|---|---|
| 2nd row | Graduation |
| 3rd row | Graduation |
| 4th row | Basic |
| 5th row | PhD |
Common Values
| Value | Count | Frequency (%) |
| Graduation | 570 | |
| PhD | 254 | |
| Master | 173 | 15.6% |
| 2n Cycle | 89 | 8.0% |
| Basic | 22 | 2.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| graduation | 570 | |
| phd | 254 | |
| master | 173 | 14.5% |
| 2n | 89 | 7.4% |
| cycle | 89 | 7.4% |
| basic | 22 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1335 | |
| r | 743 | |
| t | 743 | |
| n | 659 | 7.9% |
| i | 592 | 7.1% |
| G | 570 | 6.8% |
| d | 570 | 6.8% |
| u | 570 | 6.8% |
| o | 570 | 6.8% |
| e | 262 | 3.1% |
| Other values (12) | 1708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6782 | |
| Uppercase Letter | 1362 | 16.4% |
| Decimal Number | 89 | 1.1% |
| Space Separator | 89 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1335 | |
| r | 743 | |
| t | 743 | |
| n | 659 | |
| i | 592 | |
| d | 570 | |
| u | 570 | |
| o | 570 | |
| e | 262 | 3.9% |
| h | 254 | 3.7% |
| Other values (4) | 484 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 570 | |
| D | 254 | |
| P | 254 | |
| M | 173 | 12.7% |
| C | 89 | 6.5% |
| B | 22 | 1.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 89 |
Space Separator
| Value | Count | Frequency (%) |
| 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8144 | |
| Common | 178 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1335 | |
| r | 743 | |
| t | 743 | |
| n | 659 | |
| i | 592 | 7.3% |
| G | 570 | 7.0% |
| d | 570 | 7.0% |
| u | 570 | 7.0% |
| o | 570 | 7.0% |
| e | 262 | 3.2% |
| Other values (10) | 1530 |
Common
| Value | Count | Frequency (%) |
| 2 | 89 | |
| 89 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1335 | |
| r | 743 | |
| t | 743 | |
| n | 659 | 7.9% |
| i | 592 | 7.1% |
| G | 570 | 6.8% |
| d | 570 | 6.8% |
| u | 570 | 6.8% |
| o | 570 | 6.8% |
| e | 262 | 3.1% |
| Other values (12) | 1708 |
Marital_Status
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| Married | |
|---|---|
| Together | |
| Single | |
| Divorced | |
| Widow | 39 |
| Other values (3) | 4 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.086642599 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7852 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Together |
|---|---|
| 2nd row | Single |
| 3rd row | Married |
| 4th row | Married |
| 5th row | Together |
Common Values
| Value | Count | Frequency (%) |
| Married | 415 | |
| Together | 296 | |
| Single | 234 | |
| Divorced | 120 | 10.8% |
| Widow | 39 | 3.5% |
| Alone | 2 | 0.2% |
| YOLO | 1 | 0.1% |
| Absurd | 1 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| married | 415 | |
| together | 296 | |
| single | 234 | |
| divorced | 120 | 10.8% |
| widow | 39 | 3.5% |
| alone | 2 | 0.2% |
| yolo | 1 | 0.1% |
| absurd | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1363 | |
| r | 1247 | |
| i | 808 | |
| d | 575 | 7.3% |
| g | 530 | 6.7% |
| o | 457 | 5.8% |
| M | 415 | 5.3% |
| a | 415 | 5.3% |
| T | 296 | 3.8% |
| t | 296 | 3.8% |
| Other values (16) | 1450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6741 | |
| Uppercase Letter | 1111 | 14.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1363 | |
| r | 1247 | |
| i | 808 | |
| d | 575 | |
| g | 530 | 7.9% |
| o | 457 | 6.8% |
| a | 415 | 6.2% |
| t | 296 | 4.4% |
| h | 296 | 4.4% |
| n | 236 | 3.5% |
| Other values (7) | 518 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 415 | |
| T | 296 | |
| S | 234 | |
| D | 120 | 10.8% |
| W | 39 | 3.5% |
| A | 3 | 0.3% |
| O | 2 | 0.2% |
| Y | 1 | 0.1% |
| L | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7852 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1363 | |
| r | 1247 | |
| i | 808 | |
| d | 575 | 7.3% |
| g | 530 | 6.7% |
| o | 457 | 5.8% |
| M | 415 | 5.3% |
| a | 415 | 5.3% |
| T | 296 | 3.8% |
| t | 296 | 3.8% |
| Other values (16) | 1450 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7852 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1363 | |
| r | 1247 | |
| i | 808 | |
| d | 575 | 7.3% |
| g | 530 | 6.7% |
| o | 457 | 5.8% |
| M | 415 | 5.3% |
| a | 415 | 5.3% |
| T | 296 | 3.8% |
| t | 296 | 3.8% |
| Other values (16) | 1450 |
| Distinct | 1031 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52075.80957 |
| Minimum | 1730 |
|---|---|
| Maximum | 162397 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 1730 |
|---|---|
| 5-th percentile | 19544.85 |
| Q1 | 35768.5 |
| median | 51609.5 |
| Q3 | 68325 |
| 95-th percentile | 83834.2 |
| Maximum | 162397 |
| Range | 160667 |
| Interquartile range (IQR) | 32556.5 |
Descriptive statistics
| Standard deviation | 21310.0934 |
|---|---|
| Coefficient of variation (CV) | 0.4092129066 |
| Kurtosis | 0.602284487 |
| Mean | 52075.80957 |
| Median Absolute Deviation (MAD) | 16278.5 |
| Skewness | 0.2916339678 |
| Sum | 57699997 |
| Variance | 454120080.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7500 | 9 | 0.8% |
| 53977 | 2 | 0.2% |
| 70596 | 2 | 0.2% |
| 32892 | 2 | 0.2% |
| 33039 | 2 | 0.2% |
| 60474 | 2 | 0.2% |
| 80124 | 2 | 0.2% |
| 34176 | 2 | 0.2% |
| 38946 | 2 | 0.2% |
| 82800 | 2 | 0.2% |
| Other values (1021) | 1081 |
| Value | Count | Frequency (%) |
| 1730 | 1 | 0.1% |
| 4023 | 1 | 0.1% |
| 4861 | 1 | 0.1% |
| 5305 | 1 | 0.1% |
| 6835 | 1 | 0.1% |
| 7144 | 1 | 0.1% |
| 7500 | 9 | |
| 8940 | 1 | 0.1% |
| 9722 | 1 | 0.1% |
| 10245 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 162397 | 1 | |
| 157733 | 1 | |
| 153924 | 1 | |
| 113734 | 1 | |
| 101970 | 1 | |
| 98777 | 2 | |
| 96876 | 1 | |
| 96843 | 1 | |
| 94871 | 1 | |
| 94642 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 29 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 661 | |
| 1 | 418 | |
| 2 | 29 | 2.6% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 30 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 507 | |
| 2 | 30 | 2.7% |
| Distinct | 536 |
|---|---|
| Distinct (%) | 48.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 28-10-2013 | 7 |
|---|---|
| 03-06-2013 | 7 |
| 10-01-2013 | 6 |
| 20-08-2013 | 6 |
| 31-08-2012 | 6 |
| Other values (531) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 11080 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 201 ? |
|---|---|
| Unique (%) | 18.1% |
Sample
| 1st row | 21-01-2013 |
|---|---|
| 2nd row | 24-05-2014 |
| 3rd row | 08-04-2013 |
| 4th row | 29-03-2014 |
| 5th row | 10-06-2014 |
Common Values
| Value | Count | Frequency (%) |
| 28-10-2013 | 7 | 0.6% |
| 03-06-2013 | 7 | 0.6% |
| 10-01-2013 | 6 | 0.5% |
| 20-08-2013 | 6 | 0.5% |
| 31-08-2012 | 6 | 0.5% |
| 30-12-2012 | 5 | 0.5% |
| 07-08-2013 | 5 | 0.5% |
| 11-05-2013 | 5 | 0.5% |
| 18-05-2014 | 5 | 0.5% |
| 22-08-2012 | 5 | 0.5% |
| Other values (526) | 1051 |
Length
| Value | Count | Frequency (%) |
| 28-10-2013 | 7 | 0.6% |
| 03-06-2013 | 7 | 0.6% |
| 10-01-2013 | 6 | 0.5% |
| 20-08-2013 | 6 | 0.5% |
| 31-08-2012 | 6 | 0.5% |
| 20-02-2013 | 5 | 0.5% |
| 22-05-2013 | 5 | 0.5% |
| 13-01-2013 | 5 | 0.5% |
| 29-05-2014 | 5 | 0.5% |
| 06-05-2013 | 5 | 0.5% |
| Other values (526) | 1051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2467 | |
| - | 2216 | |
| 1 | 2088 | |
| 2 | 2032 | |
| 3 | 890 | 8.0% |
| 4 | 426 | 3.8% |
| 8 | 220 | 2.0% |
| 5 | 204 | 1.8% |
| 9 | 191 | 1.7% |
| 6 | 183 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8864 | |
| Dash Punctuation | 2216 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2467 | |
| 1 | 2088 | |
| 2 | 2032 | |
| 3 | 890 | 10.0% |
| 4 | 426 | 4.8% |
| 8 | 220 | 2.5% |
| 5 | 204 | 2.3% |
| 9 | 191 | 2.2% |
| 6 | 183 | 2.1% |
| 7 | 163 | 1.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11080 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2467 | |
| - | 2216 | |
| 1 | 2088 | |
| 2 | 2032 | |
| 3 | 890 | 8.0% |
| 4 | 426 | 3.8% |
| 8 | 220 | 2.0% |
| 5 | 204 | 1.8% |
| 9 | 191 | 1.7% |
| 6 | 183 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11080 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2467 | |
| - | 2216 | |
| 1 | 2088 | |
| 2 | 2032 | |
| 3 | 890 | 8.0% |
| 4 | 426 | 3.8% |
| 8 | 220 | 2.0% |
| 5 | 204 | 1.8% |
| 9 | 191 | 1.7% |
| 6 | 183 | 1.7% |
| Distinct | 100 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.15613718 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 15 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 25 |
| median | 51 |
| Q3 | 76 |
| 95-th percentile | 94 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 29.08558204 |
|---|---|
| Coefficient of variation (CV) | 0.5799007593 |
| Kurtosis | -1.207803544 |
| Mean | 50.15613718 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | -0.06130958542 |
| Sum | 55573 |
| Variance | 845.9710824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56 | 20 | 1.8% |
| 84 | 18 | 1.6% |
| 81 | 18 | 1.6% |
| 87 | 18 | 1.6% |
| 49 | 17 | 1.5% |
| 3 | 17 | 1.5% |
| 77 | 17 | 1.5% |
| 25 | 17 | 1.5% |
| 51 | 17 | 1.5% |
| 92 | 17 | 1.5% |
| Other values (90) | 932 |
| Value | Count | Frequency (%) |
| 0 | 15 | |
| 1 | 8 | |
| 2 | 11 | |
| 3 | 17 | |
| 4 | 14 | |
| 5 | 8 | |
| 6 | 8 | |
| 7 | 6 | 0.5% |
| 8 | 13 | |
| 9 | 14 |
| Value | Count | Frequency (%) |
| 99 | 11 | |
| 98 | 10 | |
| 97 | 9 | |
| 96 | 13 | |
| 95 | 7 | |
| 94 | 16 | |
| 93 | 12 | |
| 92 | 17 | |
| 91 | 10 | |
| 90 | 12 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.339350181 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 27 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.94327979 |
|---|---|
| Coefficient of variation (CV) | 0.8306921326 |
| Kurtosis | 7.729947829 |
| Mean | 2.339350181 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.264245415 |
| Sum | 2592 |
| Variance | 3.776336343 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 2 | 238 | |
| 3 | 143 | 12.9% |
| 4 | 99 | 8.9% |
| 5 | 48 | 4.3% |
| 6 | 29 | 2.6% |
| 0 | 27 | 2.4% |
| 7 | 21 | 1.9% |
| 8 | 9 | 0.8% |
| 10 | 5 | 0.5% |
| Other values (5) | 11 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 27 | 2.4% |
| 1 | 478 | |
| 2 | 238 | |
| 3 | 143 | 12.9% |
| 4 | 99 | 8.9% |
| 5 | 48 | 4.3% |
| 6 | 29 | 2.6% |
| 7 | 21 | 1.9% |
| 8 | 9 | 0.8% |
| 9 | 3 | 0.3% |
| Value | Count | Frequency (%) |
| 15 | 3 | 0.3% |
| 13 | 1 | 0.1% |
| 12 | 1 | 0.1% |
| 11 | 3 | 0.3% |
| 10 | 5 | 0.5% |
| 9 | 3 | 0.3% |
| 8 | 9 | 0.8% |
| 7 | 21 | |
| 6 | 29 | |
| 5 | 48 |
NumWebPurchases
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 14 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.184115523 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 26 |
| Zeros (%) | 2.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.810555731 |
|---|---|
| Coefficient of variation (CV) | 0.6717203947 |
| Kurtosis | 4.924827709 |
| Mean | 4.184115523 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.289607029 |
| Sum | 4636 |
| Variance | 7.899223517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 179 | |
| 1 | 166 | |
| 2 | 165 | |
| 4 | 142 | |
| 5 | 110 | |
| 6 | 95 | |
| 7 | 79 | |
| 8 | 59 | 5.3% |
| 9 | 34 | 3.1% |
| 10 | 27 | 2.4% |
| Other values (4) | 52 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 26 | 2.3% |
| 1 | 166 | |
| 2 | 165 | |
| 3 | 179 | |
| 4 | 142 | |
| 5 | 110 | |
| 6 | 95 | |
| 7 | 79 | |
| 8 | 59 | 5.3% |
| 9 | 34 | 3.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | 0.1% |
| 23 | 1 | 0.1% |
| 11 | 24 | 2.2% |
| 10 | 27 | 2.4% |
| 9 | 34 | 3.1% |
| 8 | 59 | |
| 7 | 79 | |
| 6 | 95 | |
| 5 | 110 | |
| 4 | 142 |
NumCatalogPurchases
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 12 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.690433213 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 279 |
| Zeros (%) | 25.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 9 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.792236397 |
|---|---|
| Coefficient of variation (CV) | 1.037838956 |
| Kurtosis | 0.3807830549 |
| Mean | 2.690433213 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.099499113 |
| Sum | 2981 |
| Variance | 7.796584094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 279 | |
| 1 | 248 | |
| 2 | 141 | |
| 4 | 95 | 8.6% |
| 3 | 83 | 7.5% |
| 5 | 76 | 6.9% |
| 6 | 60 | 5.4% |
| 7 | 37 | 3.3% |
| 10 | 31 | 2.8% |
| 8 | 27 | 2.4% |
| Other values (2) | 31 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 279 | |
| 1 | 248 | |
| 2 | 141 | |
| 3 | 83 | 7.5% |
| 4 | 95 | 8.6% |
| 5 | 76 | 6.9% |
| 6 | 60 | 5.4% |
| 7 | 37 | 3.3% |
| 8 | 27 | 2.4% |
| 9 | 22 | 2.0% |
| Value | Count | Frequency (%) |
| 11 | 9 | 0.8% |
| 10 | 31 | 2.8% |
| 9 | 22 | 2.0% |
| 8 | 27 | 2.4% |
| 7 | 37 | 3.3% |
| 6 | 60 | |
| 5 | 76 | |
| 4 | 95 | |
| 3 | 83 | |
| 2 | 141 |
NumStorePurchases
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 14 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.905234657 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 6 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 12 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.306811786 |
|---|---|
| Coefficient of variation (CV) | 0.5599797431 |
| Kurtosis | -0.758848672 |
| Mean | 5.905234657 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.6536889162 |
| Sum | 6543 |
| Variance | 10.93500419 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 233 | |
| 4 | 163 | |
| 2 | 108 | |
| 5 | 107 | |
| 6 | 80 | 7.2% |
| 8 | 73 | 6.6% |
| 7 | 67 | 6.0% |
| 10 | 60 | 5.4% |
| 12 | 59 | 5.3% |
| 9 | 57 | 5.1% |
| Other values (4) | 101 |
| Value | Count | Frequency (%) |
| 0 | 6 | 0.5% |
| 1 | 4 | 0.4% |
| 2 | 108 | |
| 3 | 233 | |
| 4 | 163 | |
| 5 | 107 | |
| 6 | 80 | 7.2% |
| 7 | 67 | 6.0% |
| 8 | 73 | 6.6% |
| 9 | 57 | 5.1% |
| Value | Count | Frequency (%) |
| 13 | 41 | 3.7% |
| 12 | 59 | 5.3% |
| 11 | 50 | 4.5% |
| 10 | 60 | 5.4% |
| 9 | 57 | 5.1% |
| 8 | 73 | |
| 7 | 67 | |
| 6 | 80 | |
| 5 | 107 | |
| 4 | 163 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.348375451 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 5 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.405114821 |
|---|---|
| Coefficient of variation (CV) | 0.4496907226 |
| Kurtosis | 2.346720892 |
| Mean | 5.348375451 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.2990003964 |
| Sum | 5926 |
| Variance | 5.784577304 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 188 | |
| 6 | 177 | |
| 8 | 169 | |
| 5 | 139 | |
| 3 | 112 | |
| 4 | 109 | |
| 2 | 94 | |
| 1 | 67 | 6.0% |
| 9 | 42 | 3.8% |
| 0 | 5 | 0.5% |
| Other values (5) | 6 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.5% |
| 1 | 67 | 6.0% |
| 2 | 94 | |
| 3 | 112 | |
| 4 | 109 | |
| 5 | 139 | |
| 6 | 177 | |
| 7 | 188 | |
| 8 | 169 | |
| 9 | 42 | 3.8% |
| Value | Count | Frequency (%) |
| 20 | 2 | 0.2% |
| 19 | 1 | 0.1% |
| 14 | 1 | 0.1% |
| 13 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| 9 | 42 | 3.8% |
| 8 | 169 | |
| 7 | 188 | |
| 6 | 177 | |
| 5 | 139 |
AcceptedCmp3
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 77 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1031 | |
| 1 | 77 | 6.9% |
AcceptedCmp4
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 95 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1013 | |
| 1 | 95 | 8.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 80 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| 1 | 80 | 7.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 76 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 1 | 76 | 6.9% |
AcceptedCmp2
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 17 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1091 | |
| 1 | 17 | 1.5% |
Complain
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 | 10 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1098 | |
| 1 | 10 | 0.9% |
Response
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.8 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1108 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1108 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 951 | |
| 1 | 157 | 14.2% |
| Distinct | 694 |
|---|---|
| Distinct (%) | 62.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 617.1218412 |
| Minimum | 6 |
|---|---|
| Maximum | 2525 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.8 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 70.75 |
| median | 412 |
| Q3 | 1068.75 |
| 95-th percentile | 1786.55 |
| Maximum | 2525 |
| Range | 2519 |
| Interquartile range (IQR) | 998 |
Descriptive statistics
| Standard deviation | 603.5879717 |
|---|---|
| Coefficient of variation (CV) | 0.978069372 |
| Kurtosis | -0.4454106947 |
| Mean | 617.1218412 |
| Median Absolute Deviation (MAD) | 367 |
| Skewness | 0.817892666 |
| Sum | 683771 |
| Variance | 364318.4395 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 46 | 13 | 1.2% |
| 22 | 11 | 1.0% |
| 48 | 8 | 0.7% |
| 15 | 8 | 0.7% |
| 37 | 8 | 0.7% |
| 57 | 7 | 0.6% |
| 41 | 7 | 0.6% |
| 20 | 7 | 0.6% |
| 45 | 7 | 0.6% |
| 38 | 7 | 0.6% |
| Other values (684) | 1025 |
| Value | Count | Frequency (%) |
| 6 | 2 | 0.2% |
| 8 | 2 | 0.2% |
| 9 | 1 | 0.1% |
| 10 | 3 | 0.3% |
| 11 | 1 | 0.1% |
| 13 | 3 | 0.3% |
| 14 | 1 | 0.1% |
| 15 | 8 | |
| 16 | 4 | |
| 17 | 6 |
| Value | Count | Frequency (%) |
| 2525 | 1 | |
| 2440 | 1 | |
| 2302 | 1 | |
| 2279 | 1 | |
| 2257 | 1 | |
| 2252 | 1 | |
| 2217 | 1 | |
| 2211 | 1 | |
| 2194 | 1 | |
| 2153 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| id | Year_Birth | Education | Marital_Status | Income | Kidhome | Teenhome | Dt_Customer | Recency | NumDealsPurchases | NumWebPurchases | NumCatalogPurchases | NumStorePurchases | NumWebVisitsMonth | AcceptedCmp3 | AcceptedCmp4 | AcceptedCmp5 | AcceptedCmp1 | AcceptedCmp2 | Complain | Response | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1974 | Master | Together | 46014.0 | 1 | 1 | 21-01-2013 | 21 | 10 | 7 | 1 | 8 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 541 |
| 1 | 1 | 1962 | Graduation | Single | 76624.0 | 0 | 1 | 24-05-2014 | 68 | 1 | 5 | 10 | 7 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 899 |
| 2 | 2 | 1951 | Graduation | Married | 75903.0 | 0 | 1 | 08-04-2013 | 50 | 2 | 6 | 6 | 9 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 901 |
| 3 | 3 | 1974 | Basic | Married | 18393.0 | 1 | 0 | 29-03-2014 | 2 | 2 | 3 | 0 | 3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| 4 | 4 | 1946 | PhD | Together | 64014.0 | 2 | 1 | 10-06-2014 | 56 | 7 | 8 | 2 | 5 | 7 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 444 |
| 5 | 5 | 1952 | Graduation | Single | 47958.0 | 0 | 1 | 19-01-2013 | 8 | 2 | 6 | 3 | 5 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 407 |
| 6 | 6 | 1971 | Graduation | Single | 22804.0 | 1 | 0 | 31-07-2013 | 75 | 1 | 2 | 0 | 2 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| 7 | 7 | 1978 | Graduation | Widow | 54162.0 | 1 | 1 | 18-03-2013 | 31 | 1 | 1 | 0 | 3 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| 8 | 8 | 1968 | Graduation | Married | 45688.0 | 0 | 1 | 25-01-2014 | 20 | 2 | 3 | 1 | 8 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 306 |
| 9 | 9 | 1952 | Graduation | Single | 61823.0 | 0 | 1 | 18-02-2013 | 26 | 4 | 8 | 2 | 10 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 884 |
Last rows
| id | Year_Birth | Education | Marital_Status | Income | Kidhome | Teenhome | Dt_Customer | Recency | NumDealsPurchases | NumWebPurchases | NumCatalogPurchases | NumStorePurchases | NumWebVisitsMonth | AcceptedCmp3 | AcceptedCmp4 | AcceptedCmp5 | AcceptedCmp1 | AcceptedCmp2 | Complain | Response | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1098 | 1098 | 1970 | PhD | Married | 23626.0 | 1 | 0 | 24-05-2014 | 84 | 3 | 3 | 1 | 3 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 43 |
| 1099 | 1099 | 1973 | PhD | Married | 85844.0 | 0 | 0 | 29-05-2014 | 62 | 1 | 6 | 6 | 7 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1958 |
| 1100 | 1100 | 1956 | Master | Single | 55284.0 | 0 | 1 | 24-12-2012 | 60 | 3 | 7 | 5 | 8 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 764 |
| 1101 | 1101 | 1971 | Master | Divorced | 42835.0 | 1 | 1 | 30-06-2013 | 64 | 7 | 6 | 6 | 4 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 595 |
| 1102 | 1102 | 1946 | PhD | Single | 82800.0 | 0 | 0 | 24-11-2012 | 23 | 1 | 7 | 6 | 12 | 3 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1315 |
| 1103 | 1103 | 1956 | Graduation | Together | 46097.0 | 0 | 1 | 31-03-2013 | 11 | 5 | 3 | 1 | 6 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 241 |
| 1104 | 1104 | 1986 | Graduation | Married | 23477.0 | 1 | 0 | 21-10-2013 | 39 | 3 | 3 | 0 | 4 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 147 |
| 1105 | 1105 | 1975 | Master | Married | 37368.0 | 1 | 0 | 16-12-2013 | 4 | 1 | 1 | 0 | 2 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| 1106 | 1106 | 1974 | Graduation | Divorced | 53034.0 | 1 | 1 | 30-05-2013 | 30 | 8 | 6 | 1 | 7 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 447 |
| 1107 | 1107 | 1952 | PhD | Divorced | 46610.0 | 0 | 2 | 29-10-2012 | 8 | 6 | 4 | 1 | 6 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 302 |